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Abstract 

Algorithms based on semi-partitioned scheduling have been proposed as a viable alternative between the two 
extreme ones based on global and partitioned scheduling. In particular, allowing migration to occur only for few 
tasks which cannot be assigned to any individual processor, while most tasks are assigned to specific processors, 
considerably reduces the runtime overhead compared to global scheduling on the one hand, and improve both 
the schedulability and the system utilization factor compared to partitioned scheduling on the other hand. 

In this paper, we address the preemptive scheduling problem of hard real-time systems composed of sporadic 
constrained-deadline tasks upon identical multiprocessor platforms. We propose a new algorithm and a schedul- 
ing paradigm based on the concept of semi-partitioned scheduling with restricted migrations in which jobs are 
not allowed to migrate, but two subsequent jobs of a task can be assigned to different processors by following a 
periodic strategy. 



1 Introduction 

In this work, we consider the preemptive scheduling of hard real-time sporadic tasks upon identical multipro- 
cessors. Even though the Earliest Deadline First (EDF) algorithm |13 | turned out to be no longer optimal in 
terms of schedulability upon multiprocessor platforms |7|, many alternative algorithms based on this scheduling 
policy have been developed due to its optimality upon uniprocessor platforms 1 13|. Again, the primary focus for 
designers is at improving the worst-case system utilization factor with guaranteeing all tasks to meet deadlines. 
Unfortunately, most of the algorithms developed in the literature suffer from a trade-off between the theoretical 
schedulability and the practical overhead of the system at run-time: achieving high system utilization factor leads 
to complex computations. Up to now, solutions are still widely discussed. 

In recent years, multicore architectures, which include several processors upon a single chip, have been the pre- 
ferred platform for many embedded applications. This is because they have been considered as a solution to the 
"thermal roadblock" imposed by single-core designs. Most chip makers such as Intel, AMD and Sun have released 
dual-core chips, and a few designs with more than two cores have been released as well. However given such a 
platform, the question of scheduling hard real-time systems becomes more complicated, and thus, has received 
considerable attention |[3][8|. For such systems, most results have been derived under either global ox partitioned 
scheduling techniques. While either approach might be viable, each has serious drawbacks on this platform, and 
neither will likely utilize the system very well. 
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In global scheduling f6\, all tasks are stored in a single priority-ordered queue. The global scheduler selects for 
execution the highest priority tasks from this queue. Unfortunately, algorithms based on this technique may lead to 
runtime overheads that are prohibitive. This is due to the fact that tasks are allowed to migrate from one processor 
(CPU) to another in order to complete their executions and each migration cost may not be negligible. Moreover, 
Dhall et al. showed that global EDF may cause a deadline to be missed if the total utilization factor of a task set is 
slightly greater than 1 |7|. 

In partitioned scheduling |]5], tasks are first assigned statically to processors. Once the task assignment is done, 
each processor uses independently its local scheduler. Unfortunately, algorithms based on this technique may lead 
to task systems that are schedulable if and only if some tasks are not partitioned. Moreover, Lopez et al. showed 
that the total utilization factor of a schedulable system using this technique is at most 50%. 

These two scheduling techniques are incomparable — at least for priority driven schedules — , that is, there 
are systems which are schedulable with partitioning and not by global and reversely. It thus follows that the 
effectiveness of a scheduling algorithm depends not only on its runtime overhead, but also its abihty to schedule 
feasible task systems. 

Recent work p pOpTj came up with a novel and promising technique called semi-partitioned scheduling with the 
main objectives of reducing the runtime overhead and improving both the schedulability and the system utilization 
factor. In semi-partitioned scheduling techniques, most tasks, called non-migrating tasks, are fixed to specific 
processors in order to reduce runtime overhead, while few tasks, called migrating tasks, migrate across processors 
in order to improve both the schedulability and the system utilization factor. However, the migration costs depend 
on the instant each migrating task is migrated during execution: migrating a task during the execution of one of 
its jobs is more time consuming than migrating it at the instant of its activation. As such, we can distinguish 
between two levels of migration: job migration where a job is allowed to execute on different processors |2| and 
task migration where task migration is allowed, but job migration is forbidden. Between the two techniques, the 
task migration is the one which minimizes the migration costs. To the best of our knowledge, only one solution 
using the latter semi-partitioned scheduling technique was established but it is only applicable to soft real-time 
systems [ 1 1. 

In this paper, we study the scheduling of hard real-time systems composed of sporadic constrained-deadline tasks 
upon identical multiprocessor platforms. For such platforms, all the processors have the same computing capaci- 
ties. 

Related work. On the road to solve the above mentioned problem, sound results using semi-partitioned schedul- 
ing techniques have been obtained by S. Kato in terms of both schedulability and system utilization factor. How- 
ever, these results are all based on "job-splitting" strategies 1 1 1 1, and consequently, may still lead to a prohibitive 
runtime overheads for the system. Indeed, the main idea of S. Kato consists in using a "job-splitting" (i.e., job 
migration is allowed) strategy based on a specific algorithm for tasks which cannot be scheduled by following the 
First Fit Decreasing (FFD) algorithm 1 12|. Figure [T] illustrates that the entire portion of job is not partitioned 
but it is split into z^^i, Tjt_2 and z^j which are partitioned upon CPUs Ki, K2 and 713, respectively. The amount of 
each share is such a value that fills the assigning processor to capacity without timing violations. 

Due to such a "job-splitting" strategy, it is necessary to have a mechanism which ensure that each job cannot 
be executed in parallel upon different processors. Such a mechanism has been provided by the introduction of 
execution windows^. Using this mechanism, each share can execute only during its execution window. Hence, 
multiple executions of the same job upon several processors at the same time are prohibited by guaranteeing that 
the execution windows do not overlap |^12|. 

This research. In this paper, we propose a new algorithm and a scheduling paradigm based on the concept of 
semi-partitioned scheduling with restricted migrations: jobs are not allowed to migrate, but two subsequent jobs 
of a task can be assigned to different processors by following a periodic strategy. Our intuitive idea is to limit the 
runtime overhead as it may still be prohibitive for the system when using a job-splitting strategy. The algorithm 
starts with a classical step where tasks are partitioned among processors using the FFD algorithm. Then, for the 
remaining tasks (i.e., those whose all the jobs cannot be assigned to one processor by following FFD without 
exceeding its capacity), a semi-partitioning scheduling technique with restricted migrations is used by following 

^An execution window for a job is a time span during it is allowed to execute. 
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Figure 1: S. Kato's algorithm p2) . 



a periodic strategy. 

Paper organization. The remainder of this paper is structured as follows. Section |2] presents the task model 
and the platform that are used throughout the paper. Section|3]provides the principles of the proposed algorithm. 
Section|4]elaborates the conditions under which the system is schedulable. Section|5]presents experimental results. 
Finally, Section|6]concludes the paper and proposes future work. 

2 System model 

def 

We consider the preemptive scheduling of a hard real-time system T = {11,12, ■■■ comprised of n tasks upon 
m identical processors. In this case, all the processors have the same computing capacities. The k''' processor 
is denoted Kk. Each task T,- is a sporadic constrained-deadline task characterized by three parameters {Ci,Di,Ti) 
where C,- is the Worst Case Execution Time (WCET), D; < 7] is the relative deadline and 7/ is the minimum inter- 
arrival time between two consecutive releases of T,. These parameters are given with the interpretation that task T, 
generates an infinite number of successive jobs ijj with execution requirement of at most C; each, arriving at time 
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Oij such that —Oij > Tj and that must completes within [aij,di,j) where dij = ai j +Di. 

def 

The utilization factor of task T, is defined as m,- — Ci/Ti and the total utilization factor of task system i is defined 

def 

as t/sum('c) = L/Li A task system is said lo fully utilize the available processing capacity if its total utilization 
factor equals the number of processors (m). The maximum utilization factor of any task in i is denoted Mmax('c)- 
A task system is preemptive if the execution of its job may be interrupted and resumed later In this paper, we 
consider preemptive scheduling poUcies and we place no constraints on total utilization factor. 

We consider that tasks are scheduled by following the Earliest Deadline First (EDF) scheduler with restricted 
migrations. That is, on the one hand, the shorter the deadline of a job the higher its priority. On the other hand, 
tasks can migrate from one processor to another but when a job has started upon a given processor then it must 
complete without any migration. Such a strategy is a compromise between task partitioning and global scheduling 
strategies. 

We assume that all the tasks are independent, that is, there is no communication, no precedence constraint and no 
shared resource (except for the processors) between tasks. We assume that the jobs of a task cannot be executed 
in parallel, that is, for any / and j, Zij cannot be executed in parallel on more than one processor. Moreover, this 
research assumes that the costs of preempting and migrating tasks are included in the WCET, which makes sense 
since we limit migrations at task arrival instants and preemptions at task arrival or completion instants (EDF is the 
local scheduler). 

3 Principle of the algorithm 

In this section, we provide to the reader the main steps of our algorithm. Its design is based on the concept of 
semi-partitioned scheduling with restricted migrations and consists of two phases: 

(1) The assigning phase: here, each non-migrating task is assigned to a specific CPU by following the FED algo- 
rithm and the jobs of each migrating tasks are assigned to CPUs by following a periodic strategy. 

(2) The scheduling phase: here, each migrating task is modeled upon each CPU as a multiframe task and thus the 
schedulability analysis focuses on each CPU individually, using the rich and extensive results from the uniproces- 
sor scheduling theory. 

3.1 The assigning phase 

In this phase, each task Xk is assigned to a particular processor Kj by following the FED algorithm, as long as the 
task does not cause the system not to be schedulable upon %j. That is, Load(T'^j ) ='^sup(,>Qj dbf(t j ,t) ^ where 
Z^j denotes the subset of tasks assigned to CPU Kj and DBF is the classical Demand Bound Function Q. Such a 
task is classified into a non-migrating task. Now, if Load(T'^J ) > 1 ^ that is, all the jobs of a task cannot be assigned 
to the same processor, then the task is classified into a migrating task. The jobs are assigned to several processors 
by following a periodic strategy and are executed upon these by using a semi -partitioned scheduling with restricted 
migrations. Details on this strategy, obviously chosen for sake of simplicity during the implementations, are 
provided in the next section. 

Example. In order to illustrate this strategy for a system with a single migrating task task T, — {Ci,Di,Ti), 
we assume that T, has been assigned to a set of CPUs containing at least Ki and 712 with the periodic sequence 

O (7ti,7t2,7ti)- This means, the first job of x, will be assigned to 7ti, the second one to 712, the third one to 
processor Ki and from the fourth job of T,, this very same process will repeat cyclically upon processors Tti and 
712. Figure |2] depicts this job assignment. 

From the schedulability point of view, task T, will be duplicated upon CPUs Ki and K2 and according to the 
scheduling phase, it will be seen as two multiframe tasks ^14J . These multiframe tasks will be denoted by x' and 

T? and will have the following parameters: t/ = ((Ci,0,C,), A', 7i) and T? = ((0,C,,0),D,, Tj). As such, we will be 
able to perform the schedulability analysis upon each CPU individually, using uniprocessor approaches, already 
well understood. 
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Figure 2: Periodic job assignment. 



3.2 Periodic job assignment strategy 



Since each task consists of an infinite number of jobs, a potentially infinite number of periodic job assignment 
strategies can be defined for assigning migrating tasks to the CPUs. In this section, we propose two algorithms to 
achieve this: The most regular-job pattern algorithm and the alternative-job pattern algorithm. 

In this paper, since each migrating task is modeled as a multiframe task upon each CPU it has at least one job 
assigned to, we assume that the number of frames obtained from each migrating task is equal to a non-negative 
integer K which is known beforehand for sake of readability. This assumption will be relaxed in future work. 

Letting A[l • • n, 1 • • m] be an matrix of integers where the first index refers to tasks and the second index to 

def 

processors, we set the value A[;,j] = x (with 1 < / < n and I < j < m) to indicate that x jobs among K consecutive 



jobs of task t, will be executed upon processor %j. In Section 3.3 we provide the reader with details on two 
algorithms used to initialize matrix A [ 1 • • • « , 1 • • • m] , while guaranteeing for every task T, that j „, A [/ , j] ~K. 

Before going any further in this paper, it is worth noticing that if K — 1, then task migrations are forbidden. Thus, 
our model extends the classical partitioning scheduling model. If the A[i,j] are known beforehand, then the most 
regular-job pattern algorithm is considered, otherwise we consider the alternative-job pattern algorithm. 



Most regular-job pattern algorithm 

A uniform assignment of jobs of each migrating task x, among the subset of CPUs 5, upon which T, will execute 
at least one job seems to be a good idea at first glance (but we have no theoretical result proving that). For this 
reason we introduce the principle laying behind such a strategy |'9'|. In Si, we assume that the CPUs are ranged in 
a non-decreasing index-order. If Si = 0, then the system is clearly not schedulable. 

The job assignment is performed according to the following two steps for task T, . 



Step 1. The matrix A is computed thanks to Algorithm [3] (see Section 3.3 for details) such that A[/,y] jobs 
among K consecutive jobs of task T,- will be executed upon processor 7iy and such that Y.i=i m-^l^J] = 

Step 2. The assignment sequence a of task T,- is defined through K sub-sequences 



a = (oo,Oi,...,af,...,OA:_i) 



(1) 



where the (£+1)"' sub -sequence (with £ = 0, . . . ,K — I) is given in turn by the following m-tuple: 



def / 1 9 n 

Oi = (a^,0£,...,o^ 



(2) 



To define sub-sequence ac using the uniform assignment pattern, there is at most one job per CPU each time 
w.r.t. Equation^] The job of T, will be assigned to Tij if and only if: 



;■ def 



= 1 



(3) 



The goal is to assign A[i,j] jobs among K consecutive jobs of task T, to CPU Kj by following a step-case 
function. For the job, yields 1 when the job is assigned to TZj and otherwise. Figure [s] illustrates 
Equation [3] for job assignment of T, to CPU TZi when A' = 11 and A[/,y] = 4. 

assigned 
instances of r,- 
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Figure 3: Job assignment sequence to %i. 



Thanks to these rules, it follows, depending on the value of parameter K, that a direct advantage of this cyclic job 
assignment is its ability to considerably reduces the number of task migrations. 

Example. In order to illustrate our claim, consider a platform comprised of 3 identical CPUs 7ti,7t2,7t3- We 
assume that K = 11. For a migrating task T;, let us consider that A[i, 1] = 4, A[i,2] = 2 and A[i,3] = 5, meaning 
that 4 jobs of T, are assigned to CPU 7ti, 2 jobs to CPU K2 and 5 jobs to CPU 713. The computations for defining 
the job assignment sub-sequences using Equation[3]are summarized in Table[T] 
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Table 1 : Most regular-job assignment sequence 



At step £ = 5 for instance, one job of T,- will be assigned to 7ti, then one job to 7t2, and no job will be assigned 
to 713. Whenever = (Vy), as this is case at steps 1, 7 and 10 in Table 1 then none of the jobs is assigned to 
none of the CPUs. Such a sub-sequence can be ignored while defining the cyclic assignment controller In this 



6 



example, the complete assignment sequence, that will be used to define the cyclic assignment controller for jobs 
of task T, is derived as follows. 

o = (ao,ai,...,aA:_i) 

= (71:1,71:2, ^3, ^1,^3- ^3,^1, 712, ^3, ^1,^3) 

While using this strategy, even though this algorithm considerably reduces the number of task migrations, we have 
a recursion problem since to determine the job-pattern we need actually to know the job-pattern. Indeed, in order 
to know that A [i, j] jobs of task T,- are assigned to CPU Kj, we need to perform a schedulability test. However, such 
a test needs the pattern of the multiframe tasks assigned to each CPU to be known beforehand. A solution to this 
recursion problem is to use a schedulability test which requires only the number of jobs. That is, the one based on 
a worst-case scenario, which necessarily introduces a lot of pessimism. 

The next job-pattern determination technique wiU address this drawback and consequently schedules a larger 
number of task systems. 

Alternative job-pattern algorithm 

This cyclic job assignment algorithm has been designed in order to overcome the drawback highlighted in the 
previous one (see algorithms [T]-^. It considers each CPU individually and compute the matrix A incrementally. 
Initially, A [i, 1] is initialized at K. If the obtained multiframe task is not schedulable upon 7ti , then A [i, 1] is decre- 
mented to K —I. The decrement is repeated until we get the largest number jobs of T,- which are schedulable upon 
711 . In the previous example, integers between 1 1 and 5 have been tested without success, but A[/, 1] = 4 succeeded, 
then the multiframe task which will be considered upon this CPU is xj — ((Ci,0,C,,0,0,C,,0,0,C,,0,0),Z),,7;) 
using Step 2 of the previous method (Equation[3]in particular). In addition, instead of directly considering a multi- 
frame task with K frames each time, this alternative algorithm allows us to temporarily consider multiframe tasks 
with a number of frames equal to the number of remaining jobs. 

In order to determine the multiframe task zj that will be executed upon CPU K2, we temporarily consider a 
multiframe task Zj^ with (/ = K — A[i, 1]) frames (corresponding to the number of remaining jobs for if). This 
temporal multiframe task x'j^ is built by using the Step 2 of the previous method (Equation [3] in particular), and 
then we determine its pattern by considering I'j^ ((OjC;, . . . ,O^Ci),Di, Ti) with o| G {0, 1}, 1 <£<J. After this 
operation has been performed, the multiframe task T? is provided with K frames by using z] and x^^, based on the 
following two rules: (/) If the /'' frame of xj is nonzero, then the frame of X? equals zero. (//) If the frame 
of T ■ is zero and corresponds to the q'^^ zero-frame of t' , then the /'' frame of T? is determined by using the value 
of the q''^ frame of x'j^. 

These rules can be generalized very easily. Suppose our aim is determining the multiframe task corresponding 
to T, upon CPU Kk. The steps to perform are as follows. We first determine the amount of remaining jobs 

j'^^K^'£lz\A[i,q]. Then, we compute thanks to Equationjsj by using J rather than K. Finally, we determine 
by using the pseudo-code of Algorithm[l]where xf (j) denotes the frame of xf . That is, if xf = ((C,, 0,C,),Z),, 7]), 
then xf (1) = Q, xf (2) = and xf (3) = Q. 

Example. Consider the same task x, as in the previous example. Assuming only 4 jobs can be assigned to 7ti, 
then the multiframe task x' upon %i is xj = ((C,-,0,C,,0,0,C,,0,0,C,,0,0), A,?^) by using Algorithm[l] If x' is 
schedulable upon 711, then we consider 712 and we repeat the same process. If not, we try to assign jobs to 7ti, now 
assuming only 3 jobs can be assigned this time. 

In this example, we assume that 4 jobs have successfully been assigned to Ki. We now assume that neither 4, nor 
3 jobs of X/ cannot be assigned to 7t2, but 2 jobs can. Computing the intermediate task x ■ using 7=11—4 = 7 leads 
us tox-2 = ((Q,0,0,C,,0,0,0),A,7^). By applying Algorithm[l]again, we obtain xf = ((0,C,,0,0,0,0,Ci, 0,0,0,0), A, 7])- 

We repeat the same process for CPU 713. When trying to assign the 5 remaining jobs, computing x, with 7 = 5 

gives x'.^ = {{Ci,Ci,Ci,Ci,Ci),Di, T^). Again, applying Algorithm[l] we obtain x^ = ((0,0,0,C,-,C;,0,0,C,-,0,C„C,), A, Tt). 
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Algorithm 1: Computation of 


[nput: K, /t, T= (T,!,...,i:f"''),T|* 


Output: Multiframe task T* 


Function Compute {'mxl' ,'mK,inz) 


begin 








q 


^ 1 






for (j 


= 1 ...K)Ao 






Free ■(— True; 






I 


^ 1; 






wliile <k and Free ) do 








if =Othen 








else Free False; 






if Free then 






















else T*(7) = 0; 




return x* ; 



Algorithm 2: Jobs assignment for migrating task T, 


Input: K, task T, 


Output: True and A if T, is schedulable. False otherwise 


Function Algo2 (inT,, inA^.outA) 


begin 










Remjobs •^ 


- K ;/* Remaining jobs*/ 




tor(k 


= 1- 


■ ■ m) do 






for a 


= Remjobs ■ ■ ■ 1 j do 








Compute T,* by using Equation 3 ; 








-c* = Compute(T;',/f, [x] , . . .,!,' ')); 








if f T* is schedulable on % j then 










A[i,k]^j- 










RemJobs RemJobs— ; 










Exit; 






if Remjobs = then return True ; 




_ return False 
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Algorithm 3: Semi-Partitioning Algorithm 

[nput: m, T = {ti , . . . ,T„}, parameter K. 

Output: True and A if T is schedulable. False otherwise. 

Function SemiPart (mm. 'mK ,mz,ovitA) 

begin 

for (i=l ...n) do 
Sched <r- False; 

/*We try to schedule the task using FFD */ 
for (j=l ...m)Ao 

if T, schedulable on %j then 

T, is assigned to Tty; 

Sched <r- True; 

if ("Sched == Falsej then 

/*Task Xi is a migrating task. We perform 
the job assignment among the CPXSs by 
calling Agorithm^ */ 
if Algo2(t;,-,/s:,A) == False then 

/*T, is not schedulable */ 

return False; 

return True; 



3.3 Semi-partitioning scheduling algorithm 

This section describes our semi-partitioning algorithm based on the concept of semi-partitioned scheduling. Like 
traditional partitioned scheduling algorithms, the schedulers have the same scheduling policy upon each CPU, that 
is EDF in this case, but each of them is not completely independent, because several CPUs may share a migrating 
task. The principle of the algorithm is as follow. 

• Sort the tasks in decreasing order of their utilization factor. 

• For each task considered individually: 

- Select one CPU and set it as the one with the smallest index. 

- If the current task is schedulable upon the selected CPU while using the FFD algorithm (i.e. all jobs 
meet their deadlines), then assign the task to this CPU. 

- If the current task is neither schedulable upon the selected CPU nor upon the others by using the FFD 
algorithm, then determine the number of jobs that can be scheduled upon the selected CPU w.r.t. an 
EDF scheduler Assign those jobs to the selected CPU by using the multiframe task approach defined 
earlier Then, move to the next CPU. Repeat this process until all the jobs of the task are assigned to a 
CPU. 

In the end, the intuitive idea behind our algorithm is to consider one task at each step and defines the minimum 
number of CPUs required to execute all its jobs (this takes at most m iterations, where m is the number of CPUs). 
At the beginning, the algorithm performs using an FFD algorithm. Then, whenever a task cannot be assigned to a 
particular CPU, its jobs are assigned to not fully utilized CPUs (at most m). Thus, the algorithm requires at most 
0{nm) schedulability tests. 

4 Schedulability analysis 

This section derives the schedulability conditions for our algorithm. Because each migrating task is modeled as a 
multiframe task on each CPU upon which it has a job assigned to, the schedulability analysis is performed on each 
CPU individually, using results from the uniprocessor scheduling theory. As tasks are scheduled upon each CPU 
according to an EDF scheduler, a specific analysis is necessary only for CPUs executing at least one multiframe 
task. For such a CPU, classical schedulability analysis approaches such as ''Processor Demand Analysis" cannot 
applied, unfortunately. This is due to migrating tasks. We consider two scenarios to circumvent this issue: the 



9 



t'(1) r;(2) r'(3) T.'Cl) t;(2) t;(3) r'(l) t'(2) t'(3) r,'(l) t,H2) 



M M t to tei t M M t 

or IT 2T 3T 4r 5T 6T 7T 8T 9T lOT 

I it 

ha KT KT cH I 

H KT H^t- ^^J KT^ 

Figure 4: Illustration of the multiframe task t' = ( (C, , 0, C, ) , i; , 7]) . 

packed scenario and the scenario with pattern and we develop a specific analysis based on an extension of the 
Demand Bound Function (DBF) ]3|4| to multiframe tasks. We recall that a natural idea to cope with each migrating 
task is to duplicate it as many time as the number of CPUs it has a job assigned to and consider a multiframe task 
upon each CPU. 

4.1 Packed scenario 

As each migrating task T,- is modeled as a collection of at most m multiframe tasks i\, T?, . . . , T™, where i\ is to be 
executed upon CPU lij (m is the number of CPUs), this scenario considers the worst-case. This occurs when, for 
each multiframe task associated to T,, all nonzero execution requirements are at the beginning of the frames and 

the remaining execution requirements are set to zero. That is is modeled as ((C;,C/, . . . ,Ci,0, . . . ,0),D;, 7]), 
where C, is the execution requirement of task x,-. (see Appendix for the proof). 

In order to define DBF at time t (that is, our adapted Demand Bound Function for systems such as those considered 
in this paper), we need to define the contribution of each task in the time interval [0,f). As K denote the number 
of execution requirements in a multiframe task associated to T,, let £^ denote the number of nonzero execution 
requirements in if. When using the packed scenario, can be modeled as if = ((C,-, . . . ,C,,0, . . . ,0),Di, Ti). The 

elements 

challenge is to take into account that only i'f jobs of taskxf will contribute to the DBF in the time interval [0, A" - 7]). 
Based on FigureUl the contribution to DBF at time f for the multiframe task is determine as follows. 



• First, consider the number of intervals of length K Tihy time f . Letting s denote that number, we have: 

def t 



KTj 



(4) 



As such, the contribution of z'l to the DBF is at least s-£i- Q. 

• Second, consider the time interval [s ■ K ■ Ti,t), where several jobs may have their deadlines before time t. 
Assuming that all the jobs are assigned to the considered CPU, the number of jobs ("a") is given by: 



def , „ 

a = max 0, 



(f mod K Ti)-Di 
Ti 



(5) 



Since at most I'l jobs over K will be executed upon that CPU, then the exact contribution of T, in interval 
[s-K -Ti.t) is given by: min(^^, a) • C,-. 



Putting all of these expressions together, the DBF of at time t is defined as follows. 

DBF(xf , f ) = s . . C; + min (^f , a) • Q (6) 
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Schedulability Test 1. Let x = {ti,T2, ■ ■ • jtn} be a set of n constrained-deadline sporadic tasks to be scheduled 
upon an identical multiprocessor platform. For each selected CPU 7i, if iA(^ C t denotes the subset of non-migrating 
tasks upon 7t, then a sufficient condition for the system to be schedulable upon 7i is given by 

^ DBF(Ty,f)+ ^ DBF(xf,0<f Vf (7) 

In Equation [t] ij is a non-migrating task, DBF is the classical Demand Bound Function and i\ is a multiframe 
task assigned to %. 



4.2 Scenario with pattern 

This scenario, in contrast to the packed one which considers the worst-case, takes the pattern of job assignment 
provided by our algorithm into account. Indeed, implementations showed that the schedulability analysis based 
on the packed scenario may be too pessimistic, unfortunately. 

The schedulability test developed in this section is similar to the one developed previously for packed scenarios in 
that it is also based on the DBF. However the pattern of job assignment to CPUs is taken into account. Therefore, 
the only noticeable difference comes from the second term of Equation |6] which is replaced by: 



max 

£:=0 



/c+nbi(t)-l \ 

52 CijmoAK ] (8) 

In Expression + 1 and denotes the number of jobs ofT*^ in the time interval [i-/r-7/,f). 

This is, we compute the processor demand by considering that the "critical instant" coincide with the first job of 
the pattern, then the second and so on, until the A'''' job. After that, we consider the maximum processor demand 
in order to capture the worst-case. Hence we obtain: 

DBF(T„f) =^-^f-C,+maxl 2. C,jm„,^ I (9) 




Schedulability Test 2. Let x = {Ti,T2, • • • it/i} be a set of n constrained-deadline sporadic tasks to be scheduled 
upon an identical multiprocessor platform. For each selected CPU 7t, the sufficient condition for the system to be 

schedulable upon 7i is as the previous one except that in Expressionlvjthe value of DBF is now replaced by DBF. 

Note that Lemma [Tjin the Appendix provides a proof of the ''dominance'' of Schedulability Test 2 over Schedula- 
bihty Test 1. That is, all systems that ai^e schedulable using the Test 1 are also schedulable using Test 2. 



5 Experimental results 

In this section, we report on the results of experiments conducted using the theoretical results presented in Sec- 
tions [3] and |4] These experiments help us to evaluate the performances our algorithms relative to both the FED and 
the S. Kato algorithms. Moreover, they help us to point out the influence of some parameters such as the value of 
parameter K and the number of CPUs in the platform. We performed a statistical analysis based on the following 

def 

characteristics: (/) the number of CPUs is chosen in the set M = {2,4,8, 16,32,64} from practical purpose. In- 
deed multicore platforms often have a number of cores which is a power of 2, (//) the system utihzation factor for 
each CPU varies between 0.50 and 0.95, using a step of 0.05. 

During the simulations, 10.000 runs have been performed for each configuration of the pair (number of CPUs in 
the platform, system utilization factor). In the figures displayed below, a "success ratio'' of 2% of an algorithm 
SI over an algorithm 'B means that leads to a schedulability ratio of y%, where 'B leads to a schedulabihty ratio 

iy~2)%. 



For comparison reasons, the same protocol as the one described in 1 12 1 by S. Kato for generating task systems has 
been considered. That is: 
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Figure 5: Success ratio relative to parameter K for packed scenarios 
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Figure 6: Success ratio relative to parameter K for scenarios with pattern 



1. the utilization factor of each task is randomly generated in the range [0, 1] with the constraint that the sum 
of utilization factors of all tasks must be equal to the utilization factor of the whole system, 

2. the period 7) of task T, is randomly chosen in the range [100, 3000], 

3. the relative deadline D,- of task T, is set equal to its period, 

def 

4. the WCET C, of task T, is computed from its period 7] and its utilization factor m, as C,- = m,- • 7]. 

It is worth noticing that we have no control of the number of tasks composing the system. This number depends 
on the utilization factors of the tasks. 

Figure [3] depicts the curve of the success ratio relative to parameter K for ''packed scenarios". As we can see, 
the lower the value of K, the higher the success ratio. This result may be surprising and counter-intuitive at first 
glance as we would expect the contrary, that is, increasing K would improve the success ratio. A reason to this is 
provided by the pessimism introduced when packing all the nonzero frames at the beginning of each multiframe 
task. Indeed in this case, the processor demand is over-estimated at time f = for each CPU upon which there is 
a multiframe task. To illustrate our claim, when K — 2, each migrating task is assigned to at most 2 CPUs, thus 
increasing the pessimism for these CPUs. Consequently, the larger the value of K, the larger the number of CPUs 
for which the processor demand will be over-estimated. 

Figure |6] depicts the curve of the success ratio relative to parameter K for the "scenario with pattern". Here, in 
contrast to the results obtained for "packed scenarios", the curves behave as we would intuitively expect them. 
That is, increasing the value of K leads to a higher success ratio. This is due to the fact that a higher value of K 
allows the jobs of a migrating task to be assigned to a larger set of CPUs, thus reducing the global computation 
requirement upon each CPU. 
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Figure 7: Comparison of FFD, packed- (K=2), pattern- (K=20) and S. Kato's algorithms 



Now, it is worth noticing that both the packed scenario and the scenario with pattern lead to the same algorithm 
when K ^2. In fact, while considering a scenario with pattern, the multiframe execution requirements which are 
allowed for a migrating task x,- are (C,,0) and (0,C,). Hence, from the schedulability analysis view point, the one 
that will be considered is (Ci,0), which corresponds to the packed scenario. 

Figure |7] compares the performances of the algorithms which provided the best results using our approach during 
the simulations, that is, the one using packed scenarios when K = 2 and the one using scenarios with pattern when 
K — 20, and those of both the FFD and the S. Kato algorithms. 

We can see that our algorithms always outperform the FFD algorithm and are a bit behind the one proposed by S. 
Kato in terms of success ratio. However, the results provided by our algorithms strongly challenge those produced 
by the S. Kato one for large number of CPUs. Note that our algorithms in contrast to the S. Kato one do not allow 
job migrations, thus limiting runtime overheads which may be prohibitive for the system. Hence, we believe that 
our approach is a promising path to go for more competitive algorithms and for practical use. 



6 Conclusion and Future work 

In this paper, the scheduling problem of hard real-time systems comprised of constrained-deadline sporadic tasks 
upon identical multiprocessor platforms is studied. A new algorithm and a scheduling paradigm based on the con- 
cept of semi-partitioned scheduling with restricted migrations has been presented together with its schedulability 
analysis. The effectiveness of our algorithm has been validated by several sets of simulations, showing that it 
strongly challenges the performances of the one proposed by S. Kato. Future work will address two issues. The 
first issue is relaxing the constraint on parameter K as we think that is possible to define a value K for each task 
such that the number of frames are kept as small as possible. The second issue is solving the optimization problem 
taking into account the number of migrations. 

Acknowledgment. The authors would like to thank Ahmed Rahni from LISI/ENSMA for his insightful comments 



13 



on the schedulability tests of multiframe tasks. 
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A Appendix 

dcf 

Let T; be a multiframe task with a pattern o = (oi , . . . ,0/f), that is, T; = ((oiC;, . . . ,0/s:C,),Z),-, 7]) with Oj G 

{0, 1}, 1 < 7 < ^. Let denotes the number of nonzero execution requirements. Let x|''^ be the packed version of 
X, that is: Oj = 1 if j < and Oj = 0, otherwise. 

Lemma 1 

DBF(Xi,?)<DBF(x|^^0 (10) 
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J=C 



Proof By definition: 

= K-\ ^ 

DBF(Xi, = i • Q + max ( £ mod k 
In this Equation, the max term can be rewritten as: 

nbi(t)-\ \ ^ 

C, J mod /f j = max ( C, }^ Oij-modiT 1 (11) 

Note that since Oj e {0, 1} and we perform the sum of nbi{t) terms. 

c+nbi (r)-l 

Oy mod^ <niin(^,-,max(0,nfo,(;))) (12) 

Because we have at most ^, nonzero execution requirements, so, we take the minimum between and nbiit). As 
such, 

max y C, irnod*: < max(C,min(£,-,max(0,n^)i(f)))) (13) 

leading to 

A+nfc,(f)-l \ 

QjmodA: 1 < Cimin(^,,max(0,nfo,(())) (14) 



Finally, DBF(Xi,0 A - C, +Qmin(^i,max(0,nfoi(0)). Thatis, DBF(t,,?) < DBF(x[^\0- The lemma follows. 



Theorem 1 Let x, he a multiframe task. The worst-case pattern for x, is the packed pattern. 

Proof A pattern ^ is said worse than a pattern 'B if and only if schedulable implies (B also schedulable. If the 

packed pattern is schedulable and thanks to the previous lemma, then DBF(Xj,/) < DBF(xf ,f) < /,Vf, that is, the 
pattern Z is schedulable, and thus, the packed pattern is worse than Z. Since Z represent any pattern, the packed 
pattern is the worst-case pattern. The theorem follows. ■ 
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